Towards the integration of automatic speech recognition and information retrieval for spoken query processing
نویسندگان
چکیده
Spoken query processing (SQP) is the task of fulfilling an information need, inferred from a spoken query, by listing a set of ranked relevant documents. The two main sources of uncertainty in SQP lay on the realization of the speech waveform and on the realization of the observed document. The proposed integration models these uncertainties under a single probabilistic framework. A case study on movie title retrieval by voice is presented to illustrate the proposed methodology. By allowing an ontology inlet, a 14% relative gain in the model convergence was achieved. An improved mean reciprocal rank and mean inclusion rate of the retrieval outcome was obtained using the proposed framework.
منابع مشابه
Towards robust methods for spoken document retrieval
In this paper, we investigate a number of robust indexing and retrieval methods in an effort to improve spoken document retrieval performance in the presence of speech recognition errors. In particular, we examine expanding the original query representation to include confusible terms; developing a new document-query retrieval measure based on approximate matching that is less sensitive to reco...
متن کاملDesigning and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods
For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملTest Collections for Spoken Document Retrieval from Lecture Audio Data
The Spoken Document Processing Working Group, which is part of the special interest group of spoken language processing of the Information Processing Society of Japan, is developing a test collection for evaluation of spoken document retrieval systems. A prototype of the test collection consists of a set of textual queries, relevant segment lists, and transcriptions by an automatic speech recog...
متن کاملSpoken query processing for interactive information retrieval
It has long been recognised that interactivity improves the effectiveness of Information Retrieval systems. Speech is the most natural and interactive medium of communication and recent progress in speech recognition is making it possible to build systems that interact with the user via speech. However, given the typical length of queries submitted to Information Retrieval systems, it is easy t...
متن کامل